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ABSTRACT 

We present a general statistical framework for describing the effect of sample variance in the number 
counts of virialized objects and examine its effect on cosmological parameter estimation. Specifically, we 
consider effects of sample variance on the power spectrum normalization and properties of dark energy 
extracted from current and future local and high-redshift samples of clusters. We show that for future 
surveys that probe ever lower cluster masses and temperatures, sample variance is generally comparable 
to or greater than shot noise and thus cannot be neglected in deriving precision cosmological constraints. 
For example, sample variance is usually more important than shot variance in constraints on the equation 
of state of the dark energy from z < 1 clusters. Although we found that effects of sample variance on 
the us — Q m constraints from the current flux and temperature limited X-ray surveys are not significant, 
they may be important for future studies utilizing the shape of the temperature function to break the 
us — Q m degeneracy. We also present numerical tests clarifying the definition of cluster mass employed 
in cosmological modelling and accurate fitting formula for the conversion between different definitions 
of halo mass (e.g., virial vs. fixed overdensity) . 

Subject headings: 



1. INTRODUCTION 

Clusters of galaxies are the most massive and rare viri- 
alized objects in the universe. At the high mass end, their 
local abundance depends exponentially on the rms of mat- 
ter fluctuations and is thus a very sensitive probe of the 
power spectru m norm alizat ion and ma tter d ensity in the 
universe (e. g.,|Evrard| 1 989; |Frenk ct al.| 19 90; |Hcnry fc Ar " 



naudl 1991; iLiljel 1992; Jwhitc ct alj 1993; |Eke et alj 1996; 



Viana fc Lid"dkf l996; [Henry| 2000; pierpaoli eHQT 2001 



Large cluster masses and high intracluster gas tempera- 
tures make galaxy clusters observable in X-rays out to rel- 
atively high rcdshifts (z <J 1). Moreover, future Sunyaev- 
Zeldovich (SZ) surveys will provide homogeneous cluster 
samples with relatively sim ple selection functions out to 



3 (e.g., Holder et al 



2000). The SZ surveys can 
therefore be used to study evolution of cluster abundance 
over an unprecedented range of redshifts. The abundance 
of clusters above a certain mass in a given area of the sky 
as a function of redshift is very sensitive to the amplitude 
and growth rate of perturbations as well as the comoving 
volume per unit redshift and solid angle. These, in turn, 
are se nsitive to the cosmological parameters (e.g., Haimarj 
et alj 2001). Therefore, strong constraints on the cluster 



normalization, matter content, and the equation of state 
of the Universe can, in princ iple, b e obtained fr om large 



cluster surveys (Holder et al 



The potential of future c 



2001; Weller et alj 2001). 



uster surveys, however, can 



only be realized after careful studies of all possible sources 
of theoretical and observational errors and biases. First 
and foremost, it is critical to understand the relation be- 
tween observed and theoretical measures of cluster mass 
The mass-observable relations depend on the processes 
that shape the bulk properties of the intracluster medium 
(ICM). Extensive simulations of cluster formation have 
shed light onto the role of non-gravitational proce sses such 
as gal actic f eedback (e.g. , Metzler fc Evrard 1994; Navarro 
et all 1995; pialek et alj 2001; Holder & Carlstroml 2001) 



and radiative cooling (e.g., Muanwong et al. 2001). Both 
feedback and cooling appear to bring the predicted scaling 
relations between cluster X-ray luminosity, temperature, 
and mass in closer agreement with observations. However, 
both processes are notoriously difficult to incorporate into 
simulations and further numerical effort and detailed com- 
parisons with new high-quality Chandra, XMM-Newton, 
and SZ observations will be needed to test whether these 
processes alone can explain various properties of the clus- 
ter ICM and its e volution. A s was recently emphasized by 



Seljak (2002) and Ikebe et al. (2001), even the error budget 



on the local abundance is dominated by the uncertainties 
in conversion from theoretical to observable measures of 
mass. 

Next in importance is the relationship between the clus- 
ter abundance in mass and cosmology. Recently, the use of 
very large volume cosmological simulations has led to sig- 
nificant improvements in our knowledg e of t he theoretical 
cluster mass functio n flSheth fc Tor men| 1999; |Jenkins et al 



2001; |Evrard ct alj 2002; |Zhcng ct alj 2002). In particular, 



Jenkins ct alj (2001) found that the mass function at clus- 



ter scales can be described by a universal analytic fit for 
all cosmologies and redshifts to an accuracy of < 10 — 20% 



in amplitud e; this result was confirmed by Evrard et al 
(2002) and |Zhcng et af] (2002). While there is reason- 
able hope that further studies will result in even more 
accurate parameterizations of the mass function, even the 
current uncertainty is not the principle remaining com- 
ponent of the error budget in cosmological constraints. 



Evrard et al 



(2002) found that it is dominated by the 
uncertainties in conversion in mass definitions and sample 
(cosmic) variance. Indeed, the proliferation of different 
cluster mass definitions in theoretical analyses may be a 
source of considerable confu sion an d error in comparing 
predictions to observations ( White 2001). Finally, unlike 
the other sources of errors discussed above, sample vari- 
ance is usually neglected in theoretical and observational 
analyses. 
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In this paper, we present a statistical framework for de- 
scribing the sample variance for a cosmological population 
of virialized objects identified at a single epoch or over a 
range of redshifts. We apply this formalism to evaluate 
the effects of the sample variance on the current and fu- 
ture cosmological constraints, focusing on the specific cases 
of constraints from the local cluster temperature function 
and the evolution of the cluster abundance in future SZ 



surveys (e.g., Holder et al. 2000). We show that uncertain- 
ties due to sample variance are comparable to or greater 
than the Poisson errors in many cases of future interest. 
Therefore, the variance should not be neglected in anal- 
yses aiming to derive precision cosmological constraints. 
Although we found that effects of sample variance are not 
significant in the interpretation of the local cluster abun- 
dance from the current flux and temperature limited X-ray 
surveys to determine the power spectrum normalization, 
it may be important for futur e studies bas ed on the shape 
of the temperature function (Ikebe et al. 2001). We also 
present discussion and numerical tests clarifying the def- 
inition of mass in theoretical analyses and provide useful 
fitting formulas for conversions between different mass def- 
initions. 

The paper is organized as follows. In §|| we introduce 
the statistical formalism needed to describe sample vari- 
ance and its effect on cosmological parameter estimation. 
We discuss cluster scaling relations and survey selection 
in §|[ In §|| and §[| we discuss the impact of sample vari- 
ance on the interpretation of the local cluster abundance 
and future high redshift surveys dedicated to studying the 
properties of the dark energy. Finally, in a series of Ap- 
pendices, we give details of survey window calculations, 
numerical tests of the cluster mass function and bias and 
convenient fitting formulas for conversion between differ- 
ent mass definitions. 

2. STATISTICAL FORMALISM 

2.1. Sample Covariance 

Consider the general case of a population of objects se- 
lected for some property, for definiteness say their mass M 
in some range dM and assume that their number density 
fluctuation is related to the underlying linear mass field by 
a linear bias parameter 



Sn(M) = n(M)b(M)6. 



(1) 



The average number density within a set of windows 
Wi(xi) becomes 



= / d 3 XiWi(xi)[n + 8rii 



(2) 



where we assume that the windows are normalized so that 
J d 3 xWi — 1. The sample covariance of these estimates of 
the true number density becomes 



= b 2 / d 3 x t I cPxjWiixJWjixj) 



(3) 



= b 2 



d 3 k 



Wi(k)W*(k)P(k). 



Note that the large-scale structure of the universe makes 
the number density even in non-overlapping volumes co- 
vary. On top of this sample covariance one adds the usual 
shot noise variance which goes as 1/ nV fractionally. 

This expression is easily generalized to the two cases of 
interest where one wishes to extract cosmological informa- 
tion from the number density as a function of mass and/or 
redshift. In this general case, consider the covariance to 
be between windows at distinct redshifts and the selection 
to be for distinct masses, n,(Mj, Zi) and rij(Mj, Zj). Then 



(niUj) - riirij 



b{M l ,z i )b{M J ,z ] )D(z i )D{z 3 ) 



d 3 k 
(2tt) 3 



Wi(k)W*(k)P(k), (4) 



where D is the linear growth rate and we have again as- 
sumed a deterministic linear bias. 

We will assume the following meaning of the terms sam- 
ple covariance, sample variance and cosmic variance. For 
a given sample of objects, we will call the covariance be- 
tween its subsamples (e.g. Eqn. Q for subsamples of clus- 
ters in different mass or redshift bins) sample covariance. 
We will call the net effect of the covariance on a quantity of 
interest (e.g. a cosmological parameter) sample variance. 
Finally, for a sample that encompasses the entire observ- 
able universe, sample variance becomes cosmic variance. 

2.2. Survey Windows 

The calculation of the sample covariance relies on the 
description of the survey windows and the choice of bin- 
ning in redshift and in mass. A few simple cases are of 
interest for estimation purposes. Since in the considera- 
tion of windows, the distinction in Eqn. (^) of redshift and 
mass bins does not enter, we will consider the notationally 
simpler case of a single redshift and mass range in Eqn. (H) 
without loss of essential generality. 

For local surveys which cover a large fraction of the sky, 
it is often a reasonable approximation to take the window 
as a single spherically symmetric top hat volume of radius 
R. Then the variance in Eqn. (0) becomes 



,2\ 



= b 2 



'R ) 



(5) 



where an is the familiar top-hat rms of linear density fluc- 
tuations field. This quan tity is plotted for the fiducial 
ACDM cosmology of SpTl] in Fig. [|. Also plotted is the 



scaling of a white (or shot) noise power spectrum. Because 
the linear power spectrum is nearly flat for R « 100/i -1 
Mpc, sample variance and shot variance scale with volume 
in a similar fashion for typical volumes. The sampling er- 
rors on this scale exceeds 10% for a typical bias of a few. 

This basic example is readily generalized to the partially 
overlapping multiple spherical windows of a flux or mag- 
nitude limited selection and serves as a useful and simple 
order of magnitude estimate of sampling effects. 

For surveys confined to a smaller section of sky but ex- 
tending to cosmological distances, the windows can be di- 
vided up into slices in redshift. Consider a series of slices in 
redshift at comoving distance ri and width 8ri with a field 
of radius O s in radians in the small angle approximation 
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n the linear density 
field <Tjj in the fiducial ACDM cosmology (§3.1) as a function of top 
hat radius and the corresponding enclosed mass 
Because of the flatness of the linear power spectrum around R = 
100/i -1 Mpc, the rms scales in the same way as shot noise, V~ 1//2 oc 
R- 3 / 2 in this regime. The mass M„ at which cjr = 1.69 controls 
the relative strength of all effects (see Figure W) . 



and a flat spatial geometry: 

ik,,r< sinfcjj^/2 Jxikxr&s) 



Wi(k) = 2e 



(G) 



In the limit of slices that are thick compared with the 
coherence, 



{riirij) — n 



= Sij b 2 -j— ~2 



d 2 l fMlQs) 



(27T) 



10s 



P(i/n) . 



(7) 

Note that the angular window in the parenthesis goes to 
unity for 10 s <C 1. 

Eqn. (0) is clos ely rela ted to the Limber equation in 
Fourier space (e.g., Kaiser] 1992) in that there is no covari- 
ance between distinct redshift bins and only modes per- 
pendicular to the line-of-sight enter into the variance. We 
give a more general consideration of the survey window in- 
cluding sky curvature and a non-trivial angul ar m ask and 
radial selection in the Appendix |a| (see Eqn. \AA\). 



2.3. Statistical Forecasts 

While the calculation of the covariance in the previous 
sections can tell us about the relative importance of sam- 
ple and shot noise variance for various choices of mass 
and redshift binning, it does not directly translate into 
the relative importance on cosmological parameter esti- 
mation where the weighting of the bins reflects parameter 
sensitivity. 

Holder et al." (2001) introduced a useful Fisher matrix 
approximation to a Poisson likelihood analysis which es- 
sentially propagates the errors in the bins to the covariance 
of more fundamental parameters. Here we generalize the 
technique to include sample covariance. Define the Fisher 



matrix in the parameter space of interest p a as 



dpfj ' 



(8) 



where the covariance matrix includes both sample covari- 
ance and shot variance 



mrij + 5ijm/Vi 



(9) 



Then an estimate of the covariance matrix of the param- 
eters follows as 

C a(3 « (F- 1 )^ . (10) 

The Fisher approximation allows a rapid exploration of the 
parameter space(s) and the effect of survey specifications 
on statistical errors. It has been shown to be a reason- 
able app roximation for most cosmological parameters of 
interest (Holder et al. 2001). Note that the parameter 
set can include "nuisance parameters" , e.g. an amplitude 
and a power law index for an observable vs. mass relation. 
Utilizing the Fisher matrix approximation, these param- 
eters would be marginalized at the expense of increasing 
the errors 

(T a = (C aa ) 1/2 (11) 

on the parameters of interest. Marginalization can be off- 
set by prior knowledge in the form of an inverse covariance 
weighted sum of the information 

Ctot = (C- 1 + C^ or )- 1 , (12) 

or loosely speaking by summing the Fisher matrices from 
individual sources. Because an infinitely sharp prior on all 
parameters save an m element subset corresponds to con- 
sidering the reduced mxm Fisher sub-matrix, we term the 
corresponding errors on the parameters the "unmarginal- 
ized errors" . In the case of m = 1 parameter, the un- 
marginalized error is simply 1 / \JF aa . 

Finally, for the construction of priors and the interpre- 
tation of errors it is useful to note that under a re-para- 
mctcrization of the space to the set 71^ (p Q ), the covariance 
matrix transforms as 



(13) 



We will use this relation to evaluate the covariance matrix 
involving parameters that depend on the fundamental set, 
e.g. <7g and the Hubble constant. 

3. CLUSTER SURVEYS 

We now specialize the sample variance considerations to 
cluster surveys in the cont ext of spatially flat CDM mod- 
els with dark energy. In §3.1, we define the cosmological 
framework and the parameters of interest that under ly it. 
We provide further tests of the critical ingredients: the 
cluster mass function, m ass definition, and bias prescrip- 
tion in Appendix M. In §3.2, we relate cluster observables 



to halo mass, providing convenient conversion formulae 
between different definitions of halo mass in Appendix |c[ 
These relations are used to define common selection critc- 
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Fig. 2. — Mass function and bias in the fiducial ACDM cosmology 
[see Eqns. (hj) and (tq)] . Although the bias increases for M > 
M* and enhances sample errors, the exponential suppression of the 
number densities rapidly makes shot errors dominate at M> M». 
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Fig. 3. — M* controls the relative importance of sample and shot 
variance due to steeply declining number densities above M*. It 
is a strong function of redshift and an even stronger function of 
normalization. The fiducial ACDM model assumes erg = 0.92 at 
2 = and hence M*(z = 0) = 1.2 X 10 13 /i _1 Mpc. In conjunction 
with Eqn. fey), this figure can be used to determine the relative 
importance oTsample variance for cases not explicitly explored here. 



3.1. Cosmological Model 

We now specialize these considerations to cluster sur- 
veys. We associate galaxy clusters with dark matter halos 
of the same mass. The mean dif ferential como ving num- 
ber density of dark matter halos (Jenkins ct al. 2001, Eqn 
B3Q) 



dn 



d\nM 



0.301 



diner 



M dlnM 



exp[ 



lncT 



0.64| : 



and the linear bias ( [Mo fc White] 1997; |Sheth fe Tormcn| 
1999) 



b(M) = 1 + 



aS 2 /a 2 - 1 



2p 



5 c {l + (aS 2 /a 2 )P] 



(15) 



are modeled from fits to cosmological simulations with 
a = 0.75 and p = 0.3 (the values are modified to match 
the mass function for clusters in the Hubble volume simu- 
lations, R. Sheth, private communication). Here S c = 1.69 
is the threshold overdensity of spherical collapse in a mat- 
ter dominated universe; we will ignore the weak scaling of 
6 C with cosmology. er 2 (M, z) is the variance in the den- 
sity field smoothed with a top hat that encloses a mass M 
at the mean matter density today (or equivalently the co- 
moving density) p m . The correspondence in the adopted 
mass function and bias is to the mass enclosed in a region 
where the density is 180 times the mean matter density, as 
demonstrated in Appendix [b| The main cosmological scal- 
ing of both the number density and bias comes through a. 
It is useful then to scale the mass to ct(M») = S c . M*(z) 
is shown in Fig. 0. 

We consider the fiducial cosmology to be spatially flat 
and defined by 6 independent parameters: the dark en- 

1 We adopt the fit to the unsmoothed mass function of the halo 
catalogs with masses defined using the spherical overdensity of 180 
with respect to mean density. This fit describes the actual abun- 
dance of clusters in simulations; smoothing artificially increases the 
amplitude of the mass function by ~ 5 — 20% (Jenkins et al 2001) 
and is unwarranted at the high cluster masses that we consider here. 



ergy density f^DE = 0.65, equation of state w = — 1, 
physical matter density £l m h 2 — 0.148, physical baryon 
density fl^h 2 = 0.02, tilt n s — 1, and the initial normal- 
ization of the curvature fluctuations Sq = 4.79 x 10~ 5 at 
k = 0.01 Mpc" 1 to match the COBE detection. The non- 
standard definition of the normalization 5^ is given explic- 
itly in |Hrj (2001) and removes the cosmological parameter 
dependence of the more traditional choices. For the fidu- 
cial model, it converts to 8h — 4.42 x 10~ 5 , erg = 0.92, or 
M» = 1.2 x lO 13 /!" 1 ]^©. We shall see that all results are a 
strong function of the assumed normalization but can be 
rescaled through M* (see Fig. || and Eqn. (pl|)). 

3.2. Scaling Relations 

The mass of a cluster is usually not measured directly, 
but estimated from models of its X-ray luminosity, emis- 
sion weighted temperature Tx, velocity dispersion, or, in 
the future, i ts Sunyacv- ZePdovich integrated flux. 

Following tkebe et al. (2001), we will assume an Lx—Tx 
relation of 



L, = 1.23xl03?(^) 2 Vw m -. 



(16) 



Here Lx is the X-ray luminosity in the energy range of 
[0.1 — 2.4] keV and Tx is emission- weighted temperature 
of the intracluster gas. 

The connection between theoretical predictions and ob- 
servations is provided by the virial mass - temperature 
relation M v — Tx, which we parameterize as 



lO^/i-iM 







T 



x 



(l + z)(r> m A„) 1 /3 ikeV 



3/2 



,(17) 



where A„ is the virial ove rden sity with respect to the mean 
matter density (see Eqn. C6). (3 parameterizes the scaling 
and in general may be a function of Tx and cosmology. 

Nu merical simula tions by various groups give ~ 0.75 
(e.g lEvrard et all 1996; |Eke et al.| 1998; [Bryan fc Nor 



man 1998; also |Picrpaoli ct a" 2001 for a tabulation of 



5 



results), roughly independent of temperature and cos 
mology. Observatio ns favor a lower n ormalization (3 
0.54(T x /6keV) - 24 



(Finoguenov et al 



_ 2001) making the 
observed cluster sample less massive. Here we have con- 
verted from the mass at a much higher overdensity of 500 
times the critical density Af 5 oo/n m to M v . The observed 
relation consequent ly indi cates a lower normalization as in 
a fixed cosmology (Seljak 2002). We will employ the the- 
oretical M v — Tx relation to be consistent with our choice 
of cosmology: we will discuss effects of different M v — Tx 
relation in §pl Since the Jenkins mass function is defined 
at A = 180, not at A„, the virial M v — Tx relation must be 
further converted to an Miso — Tx relation to match with 
the mass function. These conversions can be done using 
simple but accurate fitting formulae for the gener al intcr- 
conversion of mass definitions assumingan NFW ( Navarro 
et al.[]l997) profile given in Appendix O. 

The relationship between M v and our M can be approx- 
imately parameterized as 



M 



W 15 h- 1 Mc 



(l + M 



10 15 /r 



VI Q 



l+b 2 



(18) 



for 14 < log(M/h Mq) < 16 and halo concentrations of 
the fiducial model with b\ — 0.168a: + 0.373a; 4 and 62 = 
0.0113a; + 0.0176a; 4 and x = 1 - Cl m (z). Note that in the 
matter-dominated limit the two mass definitions coincide. 

For the fiducial fi m = 0.35 cosmology the M — Tx rela- 
tion becomes 



M 



10 15 h~ 1 M, 







1.113/3 



(1 + z)(n m A v y/* ikev 



Because this 10% correction in temperature will propagate 
into a 50% correction of a flux-limited survey volume at 
a fixed mass, this seemingly small correction is required. 
Note however that the difference between simulation re- 
sults for f3 is at least comparable to this shift and the 
discrepancy between the theore tical fits to M — Tx in sim- 
ulations an d observed relatio n ( Hjorth et al. 1998; [Hornci 



Allen et al. 2001) is more substantial still 



et aljj.999; jNevalainen et al.| 2000; [Fmoguciiov et al.| 2001; 



3.3. Survey Selection 

The survey selection is a critical element in determining 
the importance of sample variance. Even ignoring errors 
in the conversion of observables to mass, which always 
decreases the relative importance of sample variance, the 
underlying cluster scaling relationships enter into the con- 
sideration of the survey selection and hence the survey 
window. A survey may be limited by X-ray flux, SZ flux, 
and/or temperature as well as volume or redshift and we 
will consider these as the relevant selection criteria. 

For the present purpose of determining the ultimate lim- 
itation sample variance places on cluster surveys, we take 
the scaling relations ( [l6| ) and ( |l^ ) with = 0.75 to be both 
correct and deterministic. In reality one should include in 
the final error budget the intrinsic scatter, measurement, 
and modeling uncertainties in the observables and their 
relation to mass. 

The M — Tx relation allows us to convert a temperature 
threshold, typically Tx ^> 1 — 3keV into a mass threshold 
M > M th . The combined M - T x and L x - T x relations 



convert mass to X-ray luminosity. For sample with a flux 
limit /iim, the effective radius of the volume probed by the 
survey is 

d L 1 



R = 



1 



1 



'MM) 

47T/] 



1 ira 



(20) 



which implies that the survey window in mass bins will be 
overlapping concentric spheres or sky-wedges that increase 
in radius with the mass. The sample variance calculation 
must then properly account for the covariance between 
the mass bins. Note that the survey volume V(M) oc 
^-15/4 anc j h ence strongly depends on the M — Tx relation 
adopted. This flux limit is typically placed on top of a 
volume (redshift) limit that defines, for example, a local 
sample of clusters. 

For SZ cluster surveys a flux decrement limit corre- 
sponds to d~^TgzM ~ const for a sufficiently large an- 
gular diameter distance oI a such that the clust er is within 
the effective field of view ( Holder fc Carlstrom 2001). Un- 
der the assumption that the gas density weighted electron 
temperature T$z ~ Tx, and the M — Tx relation above, 
an SZ flux limit corresponds to a mass threshold that is 
roughly constant with redshift and scales as M t h oc /3 3 / 5 
for different M — Tx relations. Note that clusters of a 
given mass are detectable to all relevant redshifts despite 
the flux limit and details of the M - T x and T x - T S z 
relations are much less important than in a flux limited X- 
ray survey. Nonetheless, the M — Tx relation has a severe 
effect on survey yields if the underlying power spectrum is 
normalized to an incorrect value by a misinterpretation of 
the local X-ray cluster abundance (see §||). 

4. LOCAL ABUNDANCE 

We consider here the effect of sample variance compared 
to shot noise and cosmological parameter degeneracy on 
the interpretation of th e local cluster abundance in the 
classic as — Qm plane (|Evrard| 1 989; Frcnk et al. 1990; 



Henry fe Arnaud| 1991; |Liljcj 1992; [White et alj l993). We 
begin with the idealized case of a volume and temperature 
limited survey and then consider the more realistic X-ray 
flux, volume and temperature limited case. 

4.1. Volume Limited Survey 

Let us start with a toy model to develop intuition for 
the effects of sample variance. Consider a volume limited 
local survey with a radius R that detects all clusters above 
a limiting mass Mth determined by a temperature limit 
and the M — Tx relation. 

First consider binning the clusters into a single mass bin 
M > M t h- There is then only one window to consider and 
we can directly compare the sample and shot noise errors 
on number density determinations. Note that sample vari- 
ance which is equal to shot variance leads to a y2 w 1.4 
increase in the errors. 

In Fig. ||, we show the sample a nd s hot errors for R = 
100, 200, 300 h^Mpc. As noted in §0 in the R = lOO/i" 1 
Mpc range both errors scale as V~ 1/z and we have removed 
this dependence by multiplying by (i?/250ft.~ 1 Mpc) 3 ^ 2 . 
The corresponding fiducial volume is 6.5 x 10 7 h~ 3 Mpc 3 , 
comparable to the volume of actual local surveys. No- 
tice that the two errors cross at M t h ~ 4 x 10 14 /i _1 Mq 
or Mth/ Mm ~ 30 with sample variance dominating at the 
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Fig. 4. — Sample vs. shot errors in number counts as a func- 
tion of threshold mass for a local survey. Sampling errors dominate 
at M < 30M* » 4 X 10 14 /i _1 M Q . Errors have been scaled by 
(R/250h~ -'-Mpc) 3 / 2 to reflect the local volume and the coincidence 
of the curves shows that both scale V~ 1//2 . Approximate results for 
alternate cosmologies or redshifts can be estimated by the scaling 
with M* (see Fig. g and Eqn. @). 



lower masses. This crossing mainly reflects the exponen- 
tial suppression of the number densities for Mth/-M* S> 1 
and hence the dramatic increase in shot noise. For small 
changes in redshifts, normalizations, and cosmologies these 
qualitative statement remain true but require rescaling. 
Roughly, 



M, 



th 



£31 



lid 



-1/2 



- - Ti 



0.3f 



1/2 



40(J 200 

0-8 



(21) 



for sample to dominate shot variance, where recall M»fid = 
1.2 x 10 3 /i _1 M Q . For a low T — Cl m h cosmology, the en- 
hancement of sample variance from the excess large scale 
power (at typical scales of R = 200/i -1 Mpc) from the 
shape of the power spectrum dominates and can substan- 
tially increase the importance of sample variance. In con- 
junction with Fig. ||, this scaling may be used as a rough 
check to determine whether sample variance is relevant to 
a given problem. For surveys that reach this limiting M t h, 
sample variance will dominate over shot noise in the total 
number density and lead to a substantial increase in the 
statistical errors. 

Now consider a survey that can also bin in masses, for 
example from individual temperature measurements. The 
relative contribution of sample variance will depend on 
what range of masses is most relevant to the question at 
hand. This sensitivity in turn can be a function of survey 
sel ectio n and cosmology. Here the Fisher matrix technique 
of §2.3 is useful since it automatically folds in these factors 
in a minimum variance weighting. 

As a simple example, consider the estimation of a single 
parameter p a with all others fixed by prior knowledge. The 
minimum variance weighting of the i mass bins is given by 
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Fig. 5. — Mass range sensitivity of the normalization determi- 
nation for surveys with a volume limit of 2 m ax and mass thresh- 
old of M = 10 14 h~ 1 Mq with and without an additional X-ray 
flux limit of f lim = 2 X 10" 14 W m 2 . With shot noise only, the 
normalization constraint gets its most significant contribution for 
3 - 4 X lO 14 h- 1 M (~ 3 keV) for the volume limit and 10 15 h- 1 M (s 
(~ 6 — 7 keV) for the additional flux limit. Correspondingly sam- 
pling errors are substantially more important for the former than 
the latter, also altering the optimal mass weighting (solid vs. dashed 
lines). 



For a diagonal covariance matrix as in the case of shot 
noise, this simply reduces to an inverse variance weighting 
of mass bins. 

For illustrative purposes, let us take a volume limit of 
z = 0.09 (R = 270k- 1 Mpc) and an all sky survey. We 
bin the masses from log(A///i _1 Af ) = 14 to 16 in 40 
steps of 0.05. The predicted mass function is sufficiently 
smooth that this binning more than suffices to recover all 
the information contained in its shape. Consider the pa- 
rameter of interest to be the power spectrum normaliza- 
tion 5(; or as- In Fig. 13, we show the weights assuming 
shot noise only (dashed lines) and shot plus sample co- 
variance (solid lines). The mass sensitivity peaks at about 
3 — 4 x 10 14 /i _1 M Q for shot noise only implying that the 
inclusion of sampling errors would produce a substantial 
change in the error budget. Indeed the Fisher approxima- 
tion yields a factor of 2 degradation in the errors on as 
or sample variance that is 3 times more important than 
shot variance. Other parameters, when also considered 
individually (unmarginalizcd) show similar error degrada- 
tions. Note that the optimal weighting in the presence of 
sample covariance shifts to higher masses and can have 
negative contributions due to the covariance of the bins 
so that neglect of sample covariance in the analysis can 
degrade errors even further. 

4.2. Flux Limited Sample 

More realistically, the sample of local clusters will be 
X-ray flux as well as volume and temperature limited so 
that low mass clusters are found only nearby. In this case, 
the mass sensitivity shifts to higher masses and sample 
variance becomes less important. 

For illustrative purposes, let us take a flux limit of 
/lim = 2 x 10 -14 W m -2 in addition to the volume limit 
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Fig. 6. — erg — f2 m constraints from a local X-ray flux and vol- 
ume limited survey at 65% CL (z max = 0.09, / Um = 2 X 10~ 14 
W m 2 ). Unshaded ellipses denote full shot and sampling errors 
while shaded ellipses denote shot errors only. Outer ellipses show 
marginalization over other parameters (except w) with very weak 
priors, inner ellipses show the unmarginalized constraints. The four 
sets of M th correspond to T x = 1.25,1.83,2.67, 3.9 keV with the 
fiducial M — Tx relation. The three higher choices of M t h have el- 
lipses shifted in steps of 0.2 in the y-direction for clarity. The effect 
of sample variance decreases with increasing parameter degeneracies 
and mass threshold. 



of z = 0.09 and mass threshold of Mth = lO^/i^M© 
(Tx = 1-25 keV), parameters typical for the current flux- 
limited sa mples of clust ers with estimated X-ray tempera- 



tures (e.g., [Ikebe et aL 2001). The mass weighting for the 
normalization parameter now peaks around 10 15 /i -1 Mq or 
temperatures around Tx = 6 — 7keV with only a small dif- 
ference if sample variance is included with shot variance. 
The degradation in normalization errors is consequently 
reduced to 20%. Different parameters have diffe ren t sen- 
sitivities. For the chosen parameter set (see §3.1), the 
relative degradation in the flux limited case is largest for 
£!de where it reaches 40%. Thus even for a flux limited 
survey, sample variance can be equal to shot variance in 
the statistical errors. 

The overall importance of sample variance can decrease 
due to other sources of errors in the estimation, includ- 
ing degeneracies in the parameters and errors in mass es- 
timation, especially at low masses or temperatures. To 
illustrate these effects, consider the usual two dimensional 
parameter space ag — Q m b y transforming our fundamental 
space as described in §2^3. In Fig. |(| we show the effect of 
sample variance on the 68% confidence region in this plane 
for several choices of the minimum mass (or temperature) 
in log(Mth//i _1 A^0) of the clusters employed, offset for the 
higher minimum masses for clarity. Larger ellipses rep- 
resent marginalization over other parameters (with very 
weak priors of <JQ, b h 2 = 0.01 and cr„ s = 0.3 mainly to elim- 
inate unphysical degeneracies) and smaller ellipses repre- 
sent unmarginalized constraints or prior-fixing of the other 
parameters. Both sets have a prior of a w = since the 
dark energy equation of state is highly degenerate with 

■ 

Not surprisingly, sample variance becomes irrelevant as 
the mass threshold is increased. Sample variance also de- 
creases in importance when going between the fixed and 



Fig. 7. — Shot and sampling errors on the number density n as a 
function of redshift for the deep (M th = lO 14 /!" 1 Mq, 12 deg 2 ) and 
wide configurations (M th = 2.5 X lO 14 /! -1 Mq, 4000 deg 2 ). For the 
deep survey, sampling errors dominate shot errors out to z ~ 0.9, 
whereas for the wide survey, they cross at z ~ 0.2 — 0.3. The 
importance of sample variance to cosmological constraints depends 
on their sensitivity in redshift relative to these crossover points. 
Approximate results for other cases of interest can be scaled from 
Fig. | and Eqn. (glj). 



marginalized cases due to the dominance of parameter 
degeneracies in the latter. For the Mth = 10 /i -1 Mg 
threshold, the increase in the errors on fl m go from 40% in 
the fixed case to 30% in the marginalized case. The effect 
of marginalization mainly comes through parameters that 
change the shape of the linear power spectrum. Realistic 
current constraints on these parameters places reality ap- 
proximately half-way in between the two extremes shown 
in Figure 0. Finally, note that the errors in the direction 
orthogonal to the degeneracy line increase negligibly with 
the addition of sample variance and threshold mass Mth- 
Since the local cluster ab undance is u sually used to con- 
strain this direction (c.f. Ikebe et"aT| 2001 for use of the 
degenerate direction) , its cosmological interpretation is es- 
sentially unmodified by sample variance or uncertainties at 
the low mass end. 



5. HIGH-Z SURVEYS 



The number abundance of high redshift clusters is widely 
recognized as being extraordinarily sensitive to cosmolog- 
ical parameters due to the exponential cut o ff in the mass 



function above M* 



Bartlctt] 1998; |Viana fc Liddlej 1999) 



Bahcall & Fan| 1998; planchard & 
Recently, its use 



in constraining the density and equation of state of the 
dark energy using planned Sunyaev-Zel'dovich ( SZ) clus- 
ter su rveys has been the focus of several studies ( Haiman 

These studies show that if 



et al 



2001; 



Weller et al 



2001). 



the shot noise of rare high redshift clusters were the only 
source of uncertainty, the SZ effect can constrain the equa- 
tion of state at the percent level. Here we consider the ef- 
fects of additional uncertainties from sample variance and 
parameter degeneracies for two classes of planned surveys: 
a deep but narrow survey volume and a wide but shallower 
survey. 

5.1. Fiducial Surveys 
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Fig. 8. — Dark energy constraints (68% CL) for the (a) deep and (b) wide high redshift cluster surveys. Panels right to left reflect the 
addition of priors from the base set of weak conservative priors to the addition of a local cluster abundance constraint to the further addition 
of sharp priors on all other parameters. Three different maximum redshifts are shown with and without the inclusion of sample variance 
errors. 



As discussed in §3.3, an SZ survey has the virtue of be- 
ing able to detect clusters of a sufficiently high mass out to 
all redshifts that they exist. In addition, the survey selec- 
tion can be described by an effective threshold ma ss Mth 
that depe nds only weak ly on redshift ( Holder et al.| 2000). 
Following Holder et al. (2001), we take as our fiducial sur- 
veys a 12 deg^ survey with a step function mass selection 
at M th = 10 14 /i _1 Mpc ("deep") and a 4000 deg 2 survey to 
M th = 2.5 x 10 14 /i _1 Mpc ("wide"). We take a minimum 
redshift of z = 0.09 to match the maximum redshift of the 
local surveys of the previous section and bin in redshift to 
Az = 0.1. The evolution of the number abundance is suffi- 
ciently smooth for the constant w models considered that 
finer binning is not needed, as we find no significant degra- 
dation of constraints when varying bin size up to Az = 0.1. 
This is comfortably larger than the expected accuracy of 
individual cluster redshifts estimat ed using photometric 



redshifts of member galaxies (e.g. Fernandez-Soto et al 
2002). We calculate the sample covariance under the win- 
dow approximation of Eqn. (^3j) in both cases. Since the 
redshift binning corresponds to scales of ~ 3000 Azh^ 1 
Mpc, the covariance is confined to a few neighboring red- 
shift bins but still should not be neglected. Likewise, the 
further Limber-like approximation of Eqn. (R) is not valid 
for the wide survey since line-of-sight modes contribute 
comparably to transverse modes for these large transverse 
scales. In addition, sky curvature begins to play a role 
for the wide survey and a more detailed anal ysis should 
employ the general window expression (Eqn. [A4|) given 
in Appendix We neglect this correction since we are 
mainly interested in the relative effects of sampling errors. 

In Fig. ^ we show the relative contribution of shot and 
sample errors as a function of the redshift of the bin. Be- 
cause of the low mass threshold of the deep survey, sam- 



pling errors dominate all the way out to z ~ 0.9. For the 
wide survey, the crossing occurs at z ~ 0.2 — 0.3 but sample 
variance remains important at z <J 0.7. The crossing point 
for other cases of interest may be scaled from Eqn. (21) 
and Fig. |^. Whether sample variance affects cosmological 
constraints depends on where in redshift the parameter 
sensitivity is maximized. 

Even though SZ surveys have no redshift limit, the selec- 
tion may still be redshift limited for two reasons: redshift 
followup and uncertainties in evolution. Since the SZ ef- 
fect is redshift independent it also gives no intrinsic handle 
on the redshift of clusters so discovered. Fortunately, the 
relative crudeness of the binning required (Az ~ 0.1) im- 
plies that photometric redshifts of cluster member galaxies 
should be more than sufficient. Still, photometric redshifts 
in the range 1 <; z <J 2.5 are difficult to obtain from optical 
ground-based observations. We shall see that even if red- 
shifts cannot be determined beyond a maximum redshift 
Zmax much of the cosmological information is recovered 
simply by binning all such clusters into a single high red- 
shift bin. More worrying is any evolution in cluster proper- 
ties that change M t h (or more generally the mass selection 
function) which cannot be accurately modeled from simu- 
lations or detailed multi- wavelength observations. In this 
case the clusters above a certain z max may be unusable for 
cosmological constraints. 



5.2. Dark Energy Constraints 

Let us now specialize these considerations to the ques- 
tion of constraining the density (^de) and equation of 
state (w) of the dark energy. Constraints in this plane and 
the relative importance of sample variance are far more 
sensitive to prior assumptions on other cosmological pa- 
rameters than the normalization from the local abundance. 
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Fig. 9. — Exploration of sensitivity of equation of state errors to assumptions in the wide survey and cosmology, (a) Maximum redshift 
with z > z m ax either absent in the survey catalog or binned into a single mass bin. (b) Mass threshold for a z ma x = 1 and (c) normalization 
of the linear power spectrum erg for z max = 1 and 3. Weak priors are assumed throughout. 



We choose three levels of prior knowledge: (1) Weak pri- 
ors that reflect a conservative view of current cosmological 
constraints = 0.065, <Jn b h 2 = 0.01, er Il3 = 0.3, c\ n £ = 2, 
(2) Weak priors plus a local cluster abundance constraint 
from ^ for the flux and volume limited case, and (3) Un- 
marginalized errors with the addition of the local cluster 
abundance constraint. 

In Fig. ||, we show the resulting constraints for the deep 
and wide surveys and maximum redshifts of z max — 0.7, 
1.0 and 3. For the deep surveys and weak priors, degra- 
dations in the errors on (f^DE, w) change slightly from 
(1.3,1.3) to (1.4,1.4) to (1.4,1.3) as z max increases. The 
degradation decreases [(1.3,1.3) to (1.1,1.4) to (1.1,1.2)] 
once the local abundance is added to a high z max make 
the constraints less dependent on the sample variance lim- 
ited low redshift end. Fixing other parameters reduces the 
effect still further. Note that the effect of priors on the ab- 
solute errors is much larger than on the relative effect of 
sample variance. For the wide survey the degradation de- 
creases from (1.3,1.4) to (1.1,1.2) as z max increases with 
weak priors and changes to (1.2,1.2) to (1.2,1.1) with the 
addition of the local abundance constraint. These smaller 
degradation factors reflects the overall decrease in the im- 
portance of sample variance with increasing mass thresh- 
old. In summary, sample variance causes a 10-40% degra- 
dation in dark energy constraints depending on the de- 
tailed assumption made. Since 40% reflects equal shot and 
sample variance contributions, sample variance cannot be 
ignored in the statistical error budget of future surveys. 
Moreover it can affect the optimal survey strategy given 
fixed observing constraints, e.g. making it more important 
to study rare high redshift clusters. 

Let us further explore these issues in the context of the 
wide survey, weak priors and the measurement of the equa- 
tion of state parameter w. Both the absolute constraints 
on w and the relative importance of sample variance are 
very sensitive to three quantities: the maximum redshift 
Zmax, the mass threshold M tn , and the fiducial value of the 
normalization erg (see Fig. ||). Note the sharp increase in 
the errors for z max < 1. If the maximum redshift simply 
reflects a lack of followup redshift information beyond this 
redshift, a fair fraction of the lost information can be re- 
gained just by binning all such clusters into a single high 
redshift bin. A low mass threshold at high redshift is also 
critical for obtaining strong constraints on w and also in- 
creases the relative importance of sample variance. Finally 



the assumed normalization of the cosmological model has 
a strong effect on the errors on w due to the lack of high 
redshift clusters as the normalization decreases. A low- 
ering of the normalization by a factor of 0.8 changes the 
errors on w by nearly a factor of 3. Lowering the normal- 
ization also reduces the importance of sample variance as 
clusters become rarer, especially at high redshift. 

6. DISCUSSION AND CONCLUSIONS 

The analyses presented in the preceding sections showed 
that for future surveys that probe ever lower cluster masses 
and temperatures, sample variance is generally compara- 
ble to or greater then the shot noise and thus cannot be ne- 
glected in deriving precision cosmological constraints. For 
example, sample variance typically doubles the statistical 
uncertainty in constraints on the dark energy equation of 
state in a SZ survey limited to z max < 1. Its relative im- 
portance is very weakly dependent on survey volume and is 
mainly a decreasing function of mass threshold of the sur- 
vey, M t h, compared with the mass of a typical halo, M*, 
because of the rarity of high mass clusters. We emphasize 
though that it is simply that the shot noise variance for 
future surveys is impressively small, and the cosmological 
prospects correspondingly bright, that the small absolute 
effect of sample variance plays any role at all! 

Although we found that effects of sample variance on the 
erg — constraints from the current flux and temperature 
limited X-ray surveys of local clusters are not significant, 
they may be important for future studies utilizing shape of 
the temperature function to break the erg — f2 m degeneracy, 
where sample variance typically increases the statistical 
variance by a factor of two. 

We have studied the effects of sample variance under the 
assumption that the fiducial ACDM cosmology is essen- 
tially correct and, most importantly, its joint normaliza- 
tion to the COBE results and cluster abundance assuming 
the usual simulation-based conversion of X-ray tempera- 
ture to mass. Recent improvements in the measurement 
of the observed mass-temperatur e relation has calle d the 
latter assum ption into question ( Finoguenov et al. 2001, 
Ikebe et ah] 2001). Because of the extraordinary sensitivity 
of cluster survey yields to the normalization of the power 
spectrum, these developments can significantly alter the 
prospects of cluster surveys including the relative impor- 
tance of s ampling variance. Employing the observed rela- 
tion from Finoguenov et aL] (2001) and fixing the number 
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of clusters at Tx > 6 keV in our idealized flux and volume 
limited surveys, we find that the normalization of the fidu- 
cial cosmology would be lowered to — 0.75 from 0.92, 
in agreement with conclusions of Seljak (2002). 

This change has opposite consequences for the impor- 
tance of sample variance for the local and high-redshift 
surveys. For the local surveys, since the number density 
in temperature is fixed, the decrease in the mass thresh- 
old, Mth increases the importance of sample variance (see 
Eqn. 1 21 1). For example, degradation in the unmarginal- 



ized errors on Ode increases from 1.4 to 1.6. Sample vari- 
ance effects would be even larger if the model is also tilted 
to match the COBE normalization. For the high-redshift 
surveys, the weak scaling of ma ss threshold with the mass- 
temperature relation (see § [O) and strong scaling of con- 
straints with normalization (see Fig. ^) make sample vari- 
ance less important compared to our analyses. More im- 
portantly, the change in the normalization relation would 
substantially degrade the ability of SZ surveys to measure 
dark energy properties due to the increase in the rarity of 
high redshift clusters. These relative changes^ remain true 
if the mass-temperature change indicates a higher dark en- 
ergy density Ode ~ 0.75, fl m h 2 = 0.13, with a er 8 = 0.85 
that then matches the COBE normalization as well as the 
Tx > 6 keV cluster abundance. 

This current ambiguity in predictions reflects the gen- 
eral point that until the mass-observable relationships for 
clusters are well understood with new observations and 
better simulations, the predicted cosmological yield of fu- 
ture cluster surveys and the statistical error budgets con- 
sidered here should be taken as highly uncertain. These re- 
lationships therefore should be the primary focus of the fu- 
ture modeling efforts and can be attacked in several ways. 
First, the use of higher resolution and more sophisticated 
cluster simulations together with new high-quality Chan- 
dra and XMM-Newton observations should fuel progress 
in our understanding of cluster scaling relations in the 
near future. The search for causes of discrepancy be- 
tween the observed and predicted M — Tx relations that 
currently compromises interpretation of the local cluster 
abundances, is already the subject of substantial observa- 
tional and modeling effort. At the same time, indepen- 
dent constraints on mass-observable relations can, in prin- 
ciple, be obtained from weak lensing analyses of large X- 
ray selected cluster samples. In addition, power spectrum 
normalization at cluster scales can be obt ained indepen- 
dently from studies of cosmic shear (e.g. Van Waerbekt 
et all] 2002). Independent measurement of eg would al- 
low use of local cluster temperature function to constrain 
present-day M — Tx relation. Although at higher redshifts 
prospects are less promising for the immediate future, deep 
X-ray and SZ follow-up observations to the SZ surveys 
should provide a wealth of data on such cluster popula- 
tions. These data will be critical for comparisons with 
numerical simulations, tests of the scaling relations evo- 
lution, and, ultimately, precision cosmological constraints 
from cluster surveys. 
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APPENDIX 

WINDOW FUNCTION 

We outline here the calculation in the case of a general 
survey window. Consider a general radial selection Ri(r) 
and angular mask 0(9, <fi) such that the set of windows 
satisfies the separability condition Wi(x) = Ri(r)0(9,4>). 
The Fourier transform of the window then becomes 

wi(k) = 4 7 r^H) i i? li (fc)e ;m r i m (^,0 fe ) (Ai) 



and (0k,4>k) define the direction of k with respect to a 
fiducial direction, e.g. the center of the window. Here the 
spherical harmonic transform of the angular mask is 



o lm = / dnrr (0,0)0(0,0), (A2) 

and the spherical Bessel transform of the radial window is 
Ru(k) = [ r 2 drji(kr)Ri(r). (A3) 



The covariance becomes 
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(A4) 

In the Limber approximation of a slowly varying P(k), 
one can use the identity 

k 2 dk 3l (fcr) j, (kr') =^6(r-r'), (A5) 

to show that the covariance is negligible 
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Further specializing to azimuthally symmetric windows 
with a normalized radial tophat profile, 
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= Sub 
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9|P(//r), (A7) 



where the angular window is 



0/ = 2tt / d cos 9Pi (cos 6)0{9) 



(A8) 



In the limit that I ^> 1 this expression agrees with the 
pillbox window Eqn. (0). Note that the angular windows 
in both expressions are normalized to unity for 10 s <C 1, 
where O s is the typical angular dimension of the window. 

NUMERICAL TESTS 

During the past several years there has been significant 
advances in our understanding of halo mass function and 
bias from simulations (e.g., Mo & White 1996; Shcth & 
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Fig. BIO. — Cluster mass functions derived from the simulation 
of a flat low-Qm CDM cosmology (fi m = 1 — ^de = T = 0.15; 
h = 0.65; n B = 1; erg = 1.07) using masses denned by Aiso and 
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A = 666.6. The error ba 
shows the fit obtained by Jenkins et al 

the dotted line shows this_ht with Aligo converted to M666r- 
scribed in the Appendix O Agreement indicates that the Lfenkins 



pisson errors. The solid line 
(2001, their Eqn. B3) while 
as da 



et alj^Jfit should be interpreted as Afigo, defined with respect to 
the mean density and that the mass function is universal for this 
definition of mass. The converted mass function fit match es both, 
our simulated mass function for A = 666 and the fit of Evrard 
et alJj2002, their Tab. 1) to the mass function from the Hubble 
volume ACDM simulation with this definition of mass. The agree- 
ment shows that conversion assuming an NFW profile adequately 
accounts for the substantial differences between Migo and Mg66- 



Tormen 1999; Jenkins et al. 2001). Nevertheless, there re- 
mains a certain confusion about the meaning of mass in the 
mass function expressions, sufficiently signi ficant to make 
analyses of cluster abundances ambiguous ([White] 2001). 
Furthermore, bias models were extensively tested against 
numerical simulations, b ut only fo r rela tively sma ll halo 

1997; |jrng 1999; |Shctb 



masses, M <; 10-20M* (Mo et al 



& Toimen 1999), which for our fiducial model corresponds 
to masses < 2.6 x 10 14 ft- _1 M . In cluster studies, espe- 
cially those at high redshift, halo masses M > 20M* must 
be interpreted and hence we test the bias model over the 
whole range. 

In this Appendix we use results of a large iV-body simu- 
lation to test the mass function and halo bias expressions 
used in our analyses. The simulation followed evolution 
of 256 3 w 1.6 x 10 7 particles in the flat CDM model with 
vacuum energy: Q m = 1 — J1de = 0.15; h = 0.65; n s = 1; 
cr 8 = 1.068; T = 0.15; L box = 512/1" 1 M©. Here T is 
the shape parameter in the adopted app roximation of the 



initial power spectrum (Bardeen et al. 1986). The sim 



ulation was run with the Adaptive Refinement Tree code 



to make our analysis a useful test of the universality of 
the mass function fit advocated by these authors. In ad- 
dition, differences between various common definitions of 
halo mass increase for lower values of n m . For example, 
consider the halo mass defined within the radius corre- 
sponding to the fixed overdensity with respect to mean 
density of the Universe, Mi 80 = (47r/3)i?f 80 Ai8oPm where 
Aiso = 180 independent of cosmology. It is is equivalent 
to the definition of mass with respect to the cosmology- 
specific virial overdensity given by the spherical collapse 
model, M v = (4Tr/3)R v Ayp 7n only for Q m = 1 because 
then Aiso = A w (see Eqn. |Cq ). For low fl m , Ai 80 and A„ 
are significantly different. For Q m = 0.15, for example, 
At, sa 532. This translates into sa 30% difference in mass 
for Mvir = 5 x lO 14 /^ 1 M (see Ap pendix | c]). T his cos- 



mology provides a sharp test of the Jenkins et al. (2001) 



assertion that the mass function of halos defined at Aiso 
is indepe nden t of cosmology. 

Figure BIO shows the halo mass functions derived from 
the simulation using masses defined for A iso and A — 
666.6. The latter definition was used by Evrard et al 



(2002) to get fits to the mass functions in the Hubble vol- 
ume simulations (it corresponds to their definition of mass 
at the overdensity of 200 with respect to the critical den- 
sity for Q m = 0.3: 200/0.3 ~ 666 .6). The solid line shows 
the fit obtained by Jenkins et al. (2001, their Eqn. B3 for 
this definition of mass^]), while the dotted line shows this 
fit with Afiso converted to Mqqq, as described in the Ap- 
pendix |c|. The figure clearly shows that halo mass in the 
universal Jenkins et al. fit should be interpreted as Miso, 
where the overdensity of 180 is defined with respect to the 
mean density. The converted mass function fit matches 
both our simulated mas s function for this overdensity and 
the fit of | vrard et alj (their Table 1). While it is clear 
that the differences in mass definition lead to substantial 
differences in t he amplitude and shape of the mass func- 
tion, Fig. B10 shows that these differences can be taken 
into acco unt b y the appropriate mass conversion. 

Figure Bll compares the average bias for halos of mass 
> M in simulation s to the pre dictions of analyt ic mod- 
els of |Mo fc White] (1996) and |Shcth fc Tormen| (1999). 
The bias in these models in linear regime is given by 
Eqn (HI) with (p, a) = (0,1) and (0.3,0.75), respectively. 
The analytic predictions were computed as a mass func- 
tion weighted average of the linear bias predictions, with 
the mass function fit of Jenkins et al. (2001). In the simu- 
lation, the bias was estimated as the average ratio of halo- 
mass and mass-mass correlation functions at linear scales: 
(b(> M)) = (6„n/£mm)iin. Namely, we estimated £ mm (r) 
and £hm(r,> M) for halos with masses greater than M 
(varying M over the entire probed range of masses; M 
here is defined for A = 180). The bias was estimated by 
averaging £,hm/£,mm over 20 — 30/i _1 Mpc where bias is 
scale-independent. This gives us an estimate of the mass 
function weighted bias as a function of halo mass. The 
figure shows that both models reproduce the mass depen- 
dence of halo bias for cluster masses. The value of the bias, 



flKra vtsov et al.| 1997)using 256 zcroth level grid and five nowever , is much better matched by the model of|§5eti 



levels [of refinement. The power; s^carum uomiali^Lioii, fc Tor ^ (1Q99) in all but the highest masg b^ 1 -^ 



a s , wL, chosen te rggfef? rgng tarrrts from bfas focai rinsta number of clusters in these bins is rather small (the last 



abundance (e.g., Pierpaoli et al. 2001) 



The adopted cosmological mod el is sufficientl y differ- 
ent from the models studied by Jenkins et al. (2001) 



3 Eqn. B3 gives fit 
what we plot in Figure 



t.n the unsmoothed mass function which is 
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Fig. Bll. — The average bia; 



for ha Ins nf 
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mass > Mi80 pre- 



Mo fe White (1996, das hfd UaA 



I ^hpth Sz Tormen (1999, solid line), as averaged over the Jenkins 
mass timction, and in the simulation (see Fig. plO|J 



dieted by t.h P analytic models of 
and [i^hpth fe Tormen (1999, solid line 
et a. 

estimated from the average ratio of halo-mass and mass-mass cor 
relation functions at linear scales: (£>(> M)) = {£,hml '?mm)iini as 
described in the text. The 2<r error bars are the Poisson errors in 
£.mm and £hm propagated to b. Note the points are not indepen- 
dent because they are derived from the same simulation and use 
overlapping cluster catalogs. 



mass bin contains only 3 clusters) and the deviations are 
not significant. The figure shows that Eqn. ( |T5| ) provides 
an accurate description of bias for the cluster mass halos. 

MASS CONVERSION 

Various definitions of the mass of a halo can be con- 
verted assuming a halo density profile. Here we provide a 
fit to the general scaling function that conv erts definitions 



under the assumption of a NFW profile (Navarro et al 
1997). 

The NFW halo profile has the functional form 

(r/r s )(l + r/r s y 
for which the mass enclosed at a radius is 

M h = 4 W 3 /(r s /r /l ) , (C2) 

where 

f{x)=x 3 [\n(l+x- 1 )-{l + x)- 1 }. (C3) 
The limiting behavior of this function is 

lim f(x) = -[1 + lnO)]a; 3 , 

lim f{x) = \x-\, (C4) 

x — >oo Z o 

which says that if 3> r s then up to a logarithmic cor- 
rection, the mass converges to that enclosed near the scale 
radius r s and if <^ r s the mass increases as r\ 



A common baseline definition of mass for comparison is 
the virial mass 



47rr 3 

M v = —^-A v p m , 



(C5) 



where the concentration parameter c = r v /r s and A„ is 
yirial overdensity w ith respect to the mean matter density 
( [Bryan fe Normarj 1998) 



18tt 2 + 82a; - 39x 2 
1 + x 



(C6) 



where x = Q m (z) — 1. We will give the conversion from 
an arbitrary definition of the halo mass to the virial mass 
although the procedure is completely general. 
Defining the halo mass as 



(C7) 



where can depend on cosmology but not on Mh, we 
relate the two radii as 



so that 



f(r s /r h ) = ^f(l/c), 



(C8) 



(C9) 



Converting between d efini tions of mass simply involves in- 
verting f(x) of Eqn. (C3) to find x{f) and hence an ex- 
plicit formula for the relationship between two different 
definitions of rh or Mh 



Mh 

M v 



Ah 



(CIO) 



The function f(x) is monotonic its single argument x 
and it is simple to form a numerical lookup table for its 
inverse. Nonetheless we here provide an accurate fitting 
formula for the inversion 



*(/) 



aif 



2p 



-1/2 



2/: 



(Cll) 



where p = a 2 + a^hi f + et^ln/) 2 and the 4 fitting 
parameters are (ai,. . . ,04) = (0.5116, —0.4283, —3.13 x 
10~ 3 ,— 3.52 x 10~ 5 ). This fit converts masses to better 
than 1% accuracy across concentrations < c < 400 
(2.5 < M v /}i~ 1 Mq < 00 in the fiducial model; for galaxy 
and cluster scales c < 20, the conversion has typical errors 
of ~ 0.3%). Note that the inversion is exact for c — > 
(/ — » 00) by construction. 

An alternate form that is exact in the limit c — > 00 can 
be obtained via Taylor expansion as 



Mh 

M v 



l + g(c, (A h /A v )^ 3 ) 



(C12) 



13 



where 

9{x> y) = ~3(a: + y)[x -xy+(l + x)(x + y)ln(l + ar) 
-(l + x)(a; + y) ln(l + x/y)] 
x[3(l + a;)(a; + y) 2 ln(l + .T) 
-x{x + Ax 2 + 6xy + 3y 2 )]- 1 , (C13) 



and for order unity y exceeds the accuracy of Eqn. (Cll) 
for c > 10. 

These formulas allow for a convenient ma pping; of M v 
Mh once c(M v ) is specified. Following 
(2001), we take 



Bullock et al 



c(M v ) = 9(1 + z)~ 1 (M v /M*)~ 



0.13 



(C14) 



where M* is evaluated at the present epoch z = 0. 
These formula also provide an accurate inverse relation 
Mh — > M v . Note that in the limit c(M v ) — > 00, the mass 
correction is small and in the limit c(M„) — + 0, it is in- 
dependent of c. Therefore, to obtain an accurate inverse 
mapp ing; one can utilize c(M^) in the inversion equation 
( C10 ) to obtain M v (Mh) to ~ 1% across all concentra- 
tions. For better accuracy one can iterate the procedure 
as c(M v {M h )). 
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